Attorney Docket No. 
016295.0693 



U.S. PATENT APPLICATION 



COMPUTER SYSTEM WITH IMPROVED ERROR DETECTION 

FIELD OF THE INVENTION 

5 The invention relates generally ito computer systems 

and in particular to modules having a non-volatile memory 
within computer systems and to computer systems with memory 
modules having a non-volatile memory section. More 
particularly, the invention relates to techniques for 
10 . retrieving information about the failure of a module. 

BACKGROUND OF THE INVENTION 

Computer memory comes in two basic forms: Random 
15 Access Memory (hereinafter RAM) and Read-Only Memory 
(hereinafter ROM) . RAM is generally used by a processor 
for reading and writing data. RAM memory is volatile 
typically, meaning that the data stored in the memory is 
lost when power is removed. ROM is generally used for 
20 storing data which will never change, such as the Basic 
Input/Output System (hereinafter BIOS) . ROM memory is non- 
volatile typically, meaning that the data stored in the 
memory is not lost even if power is removed from the 
memory . 

25 

Generally, RAM makes up the bulk of the computer 
system's memory, excluding the computer system 1 s hard- 
drive, if one exists. RAM typically comes in the form of 
dynamic RAM (hereinafter DRAM) which requires frequent 
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recharging or refreshing to preserve its contents. 
Organizationally, data is typically arranged in bytes of 8 
data bits. An optional 9th bit, a parity bit, acts as a 
check on the correctness of the values of the other eight 
5 bits. 

As computer systems become more advanced, there is an 
ever increasing demand for DRAM memory capacity. 
Consequently, DRAM memory is available in module form, in 

10 which a plurality of memory chips are placed on a small 
circuit card, which card then plugs into a memory socket 
connected to the computer motherboard or memory carrier 
card. Examples of commercial memory modules are SIMMs 
(Single In-line Memory Modules) and DIMMs (Dual In-line 

15 Memory Modules) . 

In addition to an ever increasing demand for DRAM 
capacity, different computer systems may also require 
different memory operating modes. Present memories are 

20 designed with different modes and operational features such 
as fast page mode (FPM) , extended data out (EDO) , 
synchronous DRAM (SDRAM) , double data rate SDRAM (DDR 
SDRAM) , parity and non-parity, error correcting (ECC) and 
non error correcting, to name a few. Memories also are 

25 produced with a variety of performance characteristics such 
as access speeds , refresh times and so on. Further still , 
a wide variety of basic memory architectures are available 
with different device organizations, addressing 
requirements and logical banks. 
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In order to address some of the problems associated 
with the wide variety of memory chip performance, 
operational characteristics and compatibility with system 
5 requirements, memory modules are being provided with 
presence detect (PD) data. PD data is stored in a non- 
volatile memory such as an EE PROM on the memory module. A 
typical PD data structure includes 256 eight bit bytes of 
information. Bytes 0 through 127 are generally locked by 

10 the manufacturer, while bytes 128 through 255 are available 
for system use. Bytes 0-35 are intended to provide an in- 
depth summary of the memory module architecture, allowable 
functions and important timing information. PD data can be 
read in parallel or series form, but serial PD (SPD) is 

15 already commonly in use. SPD data is serially accessed by 
the system memory controller during boot up across a 
standard serial bus such as an 1.2 C™bus (referred to 
hereinafter as an I 2 C controller) . The system controller 
then determines whether the memory module is compatible 

20 with the system requirements and if it is will complete a 
normal boot. If the module is not compatible an error 
message may be issued or other action taken. 

Other modules within the system can provide similar 
25 configuration means in for of an integrated EEPROM. In 
particular laptop computers are built modular. Each module 
can have such a non-volatile memory to store module 
specific configuration data. 
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As memory modules form the main memory in a computer 
system their proper function is most crucial within the 
system. However, even with the latest technology it is not 
always guaranteed that a memory will have no defects. Some 
5 malfunctioning of a memory module can be related to 
external components, some errors might be generated within 
the module. Usually whenever the memory module is 

malfunctioning a major system error such as a system crash 
will take place. If the error can be reproduced the user 

10 usually contacts his service person and/or brings the 
computer to a service technician for repair. By telling 
the service person about the failure he might be able to 
identify the problem and exchange the respective 
malfunctioning part of the system. However, sometimes an 

15 error cannot be reproduced. 

In yet another scenario, only the defective memory 

module is sent in or brought to a technician. The 

technician often just labels the module and sends it to a 

20 manufacturer for repair. In either case, information can 

get lost or can be missed. The whole process is rather 
cumbersome . 

25 
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Summary of the Invention 

Therefore, a need for an improved computer system 
exists. In particular a need for an improved handling of 
5 modules, in particular memory modules, within a computer 
system exists. One exemplary embodiment of the present 
invention comprises a method of operating a computer system 
with a central processing unit and a memory system coupled 
to the central processing system. The memory system 
10 comprises a plurality of memory module slots for receiving 
of memory modules. Each memory module comprises a random 
access memory section and a non-volatile memory section. 
The method comprises the steps of: 

- detecting a memory error; 

- analyzing the memory error, determining a memory 
module in which the error occurred and creating a log; and 

- storing the log in the non-volatile memory section 
of the memory module. 

Another exemplary embodiment according to the present 
invention is a method of operating a system module 
comprising a non-volatile memory section. The method 
comprising the steps of: 

- detecting an error; 

- analyzing said error and creating a log; and 

- storing said log in said non-volatile memory section 
of said system module. 
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The module or memory error can be detected during a 
diagnostic test or during normal operation. The log can 
comprise information about the error type, the location of 
5 the memory module such as the slot number, the date and 
time when the error occurred, and/or the system 
identification. The log can be stored in a cyclical 
manner, such that the most recent error are accessible in 
the manner like a flight recorder works. In other words, 

10 the oldest information stored in the system or memory 
module will be overwritten first by new incoming data. 

A computer system according an exemplary embodiment of 
the present invention comprises a central processing unit, 
a memory system coupled with the central processing unit 

15 comprising a plurality of memory module slots for receiving 
of memory modules. The memory module comprises a random 
access memory section and a non-volatile memory section. 
Furthermore means for detecting an error in the memory 
system, means for generating a log about the error, and 

20 means for storing the log in the non-volatile memory 
section of a memory module are provided. The means for 
detecting an error can be an interrupt unit generating 
respective exception or trap vectors if a memory access 
fails. The Means for generating and storing the log can be 

25 respective BIOS routines programmed for the respective 
central processing unit. 

Yet another exemplary embodiment is a computer system 
comprising a central processing unit, at least one system 
module coupled with the central processing unit comprising 
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a non-volatile memory section, means for detecting an error 
in the system module, means for generating a log about the 
error, and means for storing the log in the non-volatile 
memory section of the system module. 

The non-volatile memory can be divided in a plurality 
of sub sections each sub section storing one log* The sub 
sections can be preferably written in a cyclical manner. 
Again, the log can comprise information about the error 
type, the location of the memory module, the date and time 
when said error occurred and information about the system 
ident i f i cat ion . 
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Brief Description of the Drawings 

A more complete understanding of the present 
disclosure and advantages thereof may be acquired by 
5 referring to the following description taken in conjunction 
with the accompanying drawings, in which like reference 
numbers indicate like features, and wherein: 

Figure 1 is a block diagram of a personal computer 
system according to the present invention; 
10 Figure 2 is a diagram showing a memory module usable 

for a system according to the present invention; 

Figure 3 is a flow chart according to an exemplary 
embodiment of the present invention; 

Figure 4 shows handling sequence after detection of an 
15 error according to an exemplary embodiment of the present 
i nve n t i on ; and 

Figure 5 shows another handling sequence for a memory 
module according to an exemplary embodiment of the present 
invention, 

20 
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Detailed Description of the Preferred Embodiments 

Turning to the drawings, exemplary embodiments of the 
present application will now be described. Figure 1 shows a 
5 block diagram of a portable computer system 100, such as a 
laptop computer. The system 100 comprises a central 
processing unit 180 (CPU) as its central element. Connected 
to the CPU 180 is an internal bus 110 for coupling of 
peripheral elements. One or more of these peripherals is 

10 usually a chip set 12 0 for interfacing the memory system and 
extension cards, such as, PCI-, PCIX-, ISA-Bus compatible 
cards. Therefore, the chip set 120 provides interfaces, for 
example, to a PCI bus 130 and an ISA bus 140. To couple the 
CPU with a memory system 150, the chip set 120 provides a 

15 memory bus 160 and a control bus 170. The memory system can 
consist of a plurality of slots in which a user can plug in 
memory modules, such as DIMMs, SIMMs, etc. In this scenario 
the chip set 120 provides the necessary memory controller 
unit. In another embodiment, memory system 150 includes a 

20 memory controller which generates all necessary signals 
provided to the respective memory slots receiving one or more 
memory module. 

As mentioned above, a memory module comprises the actual 
dynamic random access memory (DRAM) as well as a small non- 
25 volatile memory area. In another embodiment a system module, 
for example, a hard drive sub system can comprise a small 
non-volatile memory area which is mainly used for 
configuration purposes similar to the memory module described 
above. The above mentioned memory module is shown as such a 
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system module in more detail in figure 2. A system module 
200 is shown in form of a memory module which is divided into 
a main section 210 containing the actual DRAM and a non- 
volatile section 220, 230. Typical sizes of this DRAM area 
5 are 64 Mbytes, 128 Mbytes, 256 Mbytes, 512 Mbytes, etc. The 
non-volatile memory area consists of two electrical erasable 
programmable read only memory sections (EEPROM) 220 and 230. 
Memory module 2 00 is coupled through a bus 250 with a memory 
controller 240 which can be part of the memory system or the 

10 chip set 12 0 according to figure 1. Non- volatile memory bank 
22 0 usually contains configuration information about the 
respective memory module or the respective system module. 
Bank 220 comprises 128 data bytes. The information contained 
in bank 220 and bank 230 for a memory module is shown in 

15 Table 1. 



TABLE 1 



BANK 22 0 
20 BYTE NOS. DATA 

0-35 Module functional and performance information 

36-61 Superset data 

62 SPD Revision 

63 Checksum for bytes 0-62 

25 64-127 Manufacturer's information 



BANK 230 

BYTE NOS. DATA 

128-255 Reserved for system use 
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The PD data in bytes 0-35 can be used by a system 
controller to verify compatibility of the memory module 20 
and the system requirements. The PD data can be read in 
5 serial or parallel format. Although serial PD data (SPD) is 
used in the exemplary embodiments herein, those skilled in 
the art will appreciate that the invention can be used with 
parallel PD data. 

The information contained in bytes 0-127 is generally 
10 locked by the manufacturer after completion of the module 
□ build and test. This ensures that the data is not corrupted 

"™ or overwritten at a later time. 

m In a system according to the prior art, bank 23 0 is 

S usually not used for any purposes. Up to now, any 

15 malfunction of a computer system 100 causes either a 
s respective error message on the screen or even worse will 

% results in a freeze of the system, such that the only remedy 

is a reset. However, whenever a module, such as the memory 
jS system malfunctions, usually one of the memory modules or the 

^ 20 memory controller is defect. Such a defect is usually 
detected by the system software, for example, the basic input 
output system software (BIOS) . Respective error messages 
which are more or less descriptive will then be displayed to 
a user. In case of a descriptive message the user might be 
25 able to identify the problem and, for example, replace the 
defect system module. However, in many cases, in particular 
in case of a defect memory module, the malfunctioning module 
will be sent to the manufacturer without any additional 
information, for example, the information which was displayed 
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on the screen of the respective malfunctioning computer 
system. 

According to the present invention this information will 
be written into the unused memory bank 230 of the respective 
5 malfunctioning system module 200. The information may 
contain any type of useful information so that a technician 
will be able to later reconstruct what has happened in the 
malfunctioning system. For example, the information can 
contain some computer type information, the error type, the 

10 slot number in which the malfunctioning memory module was 
located at that time, and the date and time. Any type of 
memory failure information can be written into this memory 
bank 23 0, for example, in cyclical log form. The host 
computer 100 has access to this log to create, update or read 

15 the information via BIOS commands. 

Thus, each individual failed memory module will now have 
individual log information that is part of the hardware. The 
failure information and condition will stay internally with 
the module permanently until it is erased or overwritten by 

20 the host computer 100, a tester or a device that can access 
to the non-volatile memory bank 230. The host system 100 can 
now use the log information to verify the condition of each 
memoir/ module within each start-up routine or during a test 
routine. In addition, the memory module manufacturer now can 

25 use the log in complement with existing tagging systems to 
study the respective failure mode. 

With this new concept, a computer manufacturer has the 
advantage of time reduction during trouble shooting and 
replacement of failed memory modules and a better way to 
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document the failure on the manufacturing line. 



In the 



field, this method will help to reduce the number of 
unnecessary dispatches, a better diagnostic tool and a 
complement to the existing way to document failure at the 
5 customer site. 

As can be readily seen by someone skilled in the art 
this method is not limited to memory modules but can be used 
with any other system module having a non-volatile memory 
section which is unused, such as a configuration memory. 
10 Figure 3 shows a flow chart diagram of how the log 

n information is written into the non-volatile memory bank. 

^ This routine can be implemented as an exception routine. A 

HI memory failure in any memory module, for example, can 

S generate an interrupt or trap which interrupts the execution 

*j£ 15 of the current instruction sequence and branches to start 



point 3 00 . The generation of such an exception is usually 
done as follows. The CPU 180 of system 100 tries to access a 
specific memory location within one of the memory modules 



20 possible due to the malfunctioning, the CPU has an assigned 
trap or exception vector for such a memory access . The BIOS 
comprises a respective routine for this exception vector. In 
this routine the error can be documented for further use of 
the system software. For example, this routine can store the 

25 exact address that has been used, the data that has been 
tried to store, the last program counter from the stack, etc. 
Furthermore, the slot number of the respective memory module, 
and date and time the error occurred can be documented. In 
step 310 the routine gathers this information about the 



which is assumed to malfunction. 



As an access is not 
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current malfunctioning. For example, the BIOS can provide a 
respective routine to read the specific part in the DRAM of 
the computer system 100 that contains the above mentioned 
information. In step 320 this information is decoded and 
5 transformed into the respective log information. For 
example, the stored address of the malfunctioning memory cell 
is used to determine the memory module containing the 
address. In addition, information about the computer, such 
as the CPU, model, production year etc. can be retrieved from 

10 the computer system. The transformed log information is then 
stored into memory bank 230 in step 330. To this end, in a 
first step the content of memory bank 23 0 is erased applying 
respective control signals to bank 230 of the EEPROM. In a 
second step the actual data is written into the bank 230 

15 using appropriate control signals. 

Depending on the size of each information log, either 
the whole bank 230 or only parts of it are used. To 
implement a cyclical log form the following procedure will be 
used. If, for example, 64 bits are used to document any type 

20 of error, always to consecutive error logs can be stored in 
memory bank 230. To this end, addresses 128-191 are used for 
a first log and addresses 192-255 are used for a second log. 
A following third log will erase and replace the first log 
and a fourth log will erase the second log and so on. If 

25 less information is stored within a log more logs can be 
permanently stored with this method according to the above 
described principle. 

Figure 4 shows a diagram of another embodiment according 
to the present invention. Box 400 indicates that an error 
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has been detected during a diagnostic test of the computer 
system, for example, during a start-up routine. This error 
message is sent to the system BIOS 420, The second box 410 
indicates that an error during normal operation has been 
5 detected by the chip set 120. Again, this error message is 
sent of system BIOS 420. System BIOS 420 then generates a 
log entry in the upper part 230 of the EEPROM of the memory 
module 200. The stored information can be, for example: 



10 TABLE II 

- The system ID (service tag) 

- The error type (read error, write error, refresh 
error, etc.) 

- The SLOT ID (location) 
15 - Date and time 



Again, as described above more or less information can 
be generated and used to document the respective error. Each 
information is preferably coded to save memory space. For 

20 example, 8 bit can be used to define the error type. Thus, 
256 different error types can be coded. 

Figure 5 shows a diagram for the read back routine. Box 
52 0 contains the read error log routine initiated by system 
BIOS 510 which reads the respective memory module to read the 

25 information of Table II as described above. System BIOS 510 
sends this information, for example to a routine 500 for 
displaying the error log on screen or record it on a specific 
file of a analyzing system. 
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Again, the above described method and the arrangement 
were described showing a computer system with memory modules 
having non-volatile configuration memory. However, any type 
of system module having a non-volatile memory section, for 
5 example, for configuration purposes, can be easily adapted to 
use within the scope of the present invention. For example, 
peripheral cards such as network, modem, disk controller 
etc., or devices such as power supply, monitor, processor 
and so on can comprise non-volatile memory sections which 
10 have an unused data section. Access to these system 
p components /modules usually is similar to the access to the 

v 5 memory system and can produce similar data, in particular 

U? similar error data if the respective module is 

□ malfunctioning. Using the same principle as described 
[Jf 15 above, provides significant advantages to a computer 
s manufacturer in locating the respective defect. 

Furthermore, statistical data can be collected which help 
M s to eliminate any type of weakness in the production which 

□ eventually might lead to a respective defect in such a 
20 module. 

The invention, therefore, is well adapted to carry out 
the objects and attain the ends and advantages mentioned, as 
well as others inherent therein. While the invention has 
been depicted, described, and is defined by reference to 
25 exemplary embodiments of the invention, such references do 
not imply a limitation on the invention, and no such 
limitation is to be inferred. The invention is capable of 
considerable modification, alternation, and equivalents in 
form and function, as will occur to those ordinarily skilled 
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in the pertinent arts and having the benefit of this 
disclosure. The depicted and described embodiments of the 
invention are exemplary only, and are not exhaustive of the 
scope of the invention. Consequently, the invention is 
5 intended to be limited only by the spirit and scope of the 
appended claims, giving full cognizance to equivalents in all 
respects . 
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